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Abstract — Opportunistic scheduling and beamforming 
schemes have been proposed previously by the authors for 
reduced-feedback MIMO-OFDMA downlink systems where the 
MIMO channel of each subcarrier is decomposed into layered 
spatial subchannels. It has been demonstrated that significant 
feedback reduction can be achieved by returning information 
about only one beamforming matrix (BFM) for all subcarriers 
from each MT, compared to one BFM for each subcarrier in the 
conventional schemes. However, since the previously proposed 
channel decomposition was derived based on singular value 
decomposition, the resulting system performance is impaired by 
the subchannels associated with the smallest singular values. 
To circumvent this obstacle, this work proposes improved 
opportunistic scheduling and beamforming schemes based on 
geometric mean decomposition-based channel decomposition. 
In addition to the inherent advantage in reduced feedback, the 
proposed schemes can achieve improved system performance by 
decomposing the MIMO channels into spatial subchannels with 
more evenly distributed channel gains. Numerical results confirm 
the effectiveness of the proposed opportunistic scheduling and 
beamforming schemes. 

I. Introduction 

Orthogonal frequency-division multiple-access (OFDMA) 
has recently emerged as one of the most promising multi- 
plexing techniques for the next-generation broadband wireless 
communication systems. In an OFDMA downlink system, the 
base station (BS) simultaneously transmits data to several 
active mobile terminals (MTs) by modulating each MT's data 
on an exclusive set of orthogonal subcarriers. In addition to 
its robustness to multipath fading and high spectral efficiency, 
OFDMA is particularly attractive due to its flexibility in 
dynamically allocating subcarriers to different MTs based 
on their different quality of service (QoS) requirements and 
channel conditions [1]. Furthermore, the recent advances 
in multiple-input multiple-output (MIMO) techniques have 
inspired considerable research interest in MIMO-OFDMA. 
However, perfect channel state information (CSI) is generally 
required at the BS in order to fully harvest the advantages 
provided by MIMO in the downlink transmission, which 
incurs a formidable amount of feedback from MTs to the 
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BS. To circumvent this obstacle, opportunistic scheduling and 
beamforming schemes were proposed for single-carrier (SC) 
systems by exploiting multiuser diversity with limited channel 
feedback [2], [3]. However, little work has been done on 
developing opportunistic scheduling schemes for OFDMA. 
Unlike single-carrier systems, the amount of feedback over- 
head required in OFDMA grows at a rate proportional to the 
number of subcarriers, which makes it challenging to design 
feedback-limited scheduling schemes for OFDMA. 

Assuming approximately identical channel fading over ad- 
jacent subcarriers, the pioneering work of [4] devised an 
opportunistic scheduling scheme for single-input single-output 
(SISO)-OFDMA by grouping adjacent subcarriers into exclu- 
sive clusters and returning only the average signal-to-noise 
ratio (SNR) of each cluster. The resulting feedback overhead 
becomes proportional to the number of clusters, rather than 
the number of subcarriers. However, such feedback reduction 
is achieved at the price of performance degradation since 
an increasing cluster size leads to non-negligible channel 
variations over subcarriers within each cluster. Recently, [5] 
proposed to resolve this problem by performing beamforming 
jointly by the BS and MTs. More specifically, the BS employs 
the same beamforming matrix (BFM) for all subcarriers while 
each MT completes the beamforming task for each subcarrier 
locally. As a result, for a MIMO-OFDMA system with Q 
subcarriers, [5] requires feedback of one BFM index and Q 
SNRs from each MT. However, singular value decomposition 
(SVD) is employed in [5] to decompose the MIMO channel of 
each subcarrier into layered spatial subchannels. As a result, 
the overall bit error rate (BER) performance of the system is 
usually dictated by the spatial subchannels associated with the 
smallest singular values. 

In this work, we propose improved opportunistic schedul- 
ing and beamforming schemes for MIMO-OFDMA downlink 
systems using geometric mean decomposition (GMD). Since 
direct power allocation among layered spatial subchannels is 
analytically challenging, we propose a GMD-based approach 
to decompose the MIMO channels into layered spatial sub- 
channels with more evenly distributed signal gains. Since 
the same BFM is applicable to all subcarriers, the proposed 
schemes can substantially reduce the required feedback over- 
head in a fashion similar to [5]. Analytical and simulation 
results are presented to demonstrate the effectiveness of the 
proposed schemes. 

Notation : Vectors and matrices are denoted by boldface 
letters. || || represents the Euclidean norm of the enclosed 
vector and | • | denotes the amplitude of the enclosed complex- 
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valued quantity. Jjv is the N x N identity matrix and det (•) 
is the determinant of the enclosed matrix. We use E {•} and 
(•) for expectation and Hermitian transposition, respectively. 
Finally, [A] i • denotes the z-th row and j-th column entry of 
matrix A whereas [a] i the i-th entry of vector a. 

II. Signal Model and Background 

We consider a MIMO-OFDMA downlink system with Q 
subcarriers and K active MTs. The BS and each MT are 
equipped with M and N antennas with M < N, respec- 
tively. In this work, we consider a homogeneous network in 
which the propagation path between each pair of transmit 
and receive antennas undergoes independent and identically 
distributed (i.i.d.) slow frequency-selective fading with a maxi- 
mum channel length L. We denote by h™' n (p) the fc-th MT's 
channel impulse response between its n-th receive antenna 
and the m-th transmit antenna during the p-th OFDMA block. 
Assuming that the cyclic prefix (CP) is sufficiently long and 
synchronization has been achieved, the signal received by the 
fc-th MT over its q-th subcarrier during the p-th OFDMA block 
can be written as 



yk,q(j>) = Gk,q{p) ■ X k ,q{p) + ™fc, q (p), 



(1) 



where X)~, q {p) is the pre-coded data vector and Wf,, q (p) is 
a zero-mean circularly symmetric white Gaussian noise with 
unity variance. Furthermore, Gk, q (p) is the corresponding 
frequency-domain channel matrix computed as [6] 



G k , q (p) 
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with e,; being the vector containing the first L elements of 
the i-th column of a Q-point discrete Fourier transform (DFT) 
matrix W with [W] e u = cxp for < £,u < Q-l. 

In the sequel, we focus on the fc-th MT over the p-th OFDMA 
block and omit the MT and block indices, i.e. fc and p, for 
presentational clarity. 

Before proceeding to the proposed schemes, we will first 
review two straightforward extensions of [3] that were orig- 
inally developed for SC-MIMO systems. By regarding each 
subcarrier of OFDMA as an individual system, the MIMO 
channel of the q-th subcarrier can be decomposed into the 
following form: 



Gq = Uq ■ S ? • V q 



H 



(3) 



where U q and V q are N x N and M x M unitary matrices, 
respectively. Furthermore, H q is a diagonal matrix whose first 
M diagonal elements are the singular values of G q , {X q ^} for 
i = 1,2,---, M. As shown in [3], V q can be employed as the 
BFM for the q-th subcarrier whereas U q can be employed for 
data detection at the chosen MT. However, since V q contains 
the eigen-vectors of G q and is dependent on the subcarrier 
index q, information about all Q BFMs as well as Q real- 
valued {A 9i i} has to be fed back from each MT to the BS. 



In the sequel, the opportunistic scheduling and beamforming 
scheme using (O is referred to as the per-subcarrier eigen- 
beamforming scheme (PS-EB). 

Inspired by [4], the feedback overhead of PS-EB can be 
reduced by dividing the Q subcarriers into G exclusive clusters 
with Q = G x U and returning information about only one 
BFM and the corresponding average SNR of each cluster. As 
a result, the feedback overhead is reduced by a factor of U . 
However, since V n ^ V m for n ^ m due to channel variations 
over subcarriers, using the same BFM for all subcarriers within 
one cluster leads to system performance degradation. In the 
sequel, this scheme is referred to as the per-cluster eigen- 
beamforming scheme (PC-EB). 

III. Per-subcarrier GMD-base scheme (PC-GMD) 

A. MIMO Channel Decomposition 

Rather than directly decomposing G q as (0), we first rewrite 
G q into the following form: 



Gq = 



W„ 



h 11 

h 1 ' 2 
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h 



UN 



,2.N 



h 



M.N 



W q H, 



(4) 



(5) 



where W q = [In ® e^] with ® denoting the Rronecker 



product. 

Next, H is decomposed using GMD [7] as follows: 
H = B E P H , 



(6) 



where B and P are NL x NL and M x M unitary matrices 
respectively, while E is a real-valued upper triangular matrix 
with its diagonal elements equal to the geometric mean of the 
positive singular values of H. 
Substituting (O into (O, we have 



G q = 



W„ B E P 



Qq ■ Rq ■ P 



H 



H 



(7) 
(8) 



where the last equality is obtained by the following QR 
decomposition: 



W q ■ B E = Q q R q 



(9) 



with Q q being anMxM unitary matrix and R q an M x N 
upper triangular matrix. 

Inspection of (JHJ suggests that P is independent of the 
subcarrier index q and thus can be employed as the BFM at 
the BS for all subcarriers, i.e. 



P-s n 



where s q of length M contains the data symbols with 



E{\\s q \\ 2 }=pM, 



(10) 



(11) 



and p being the average SNR. 

Furthermore, ^ indicates that the beamforming operation 
for each subcarrier is characterized by Q q which is known to 
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the MT. Recalling 0}, ® and (fTOj), data detection at each MT 
can be carried out by pre-multiplying y with Q q : 



Q*y, 



= R q ■ P H ■ P ■ S q + W q , 
= R q -S q +W' 



(12) 
(13) 
(14) 



where we have exploited the fact that Q q and P are unitary 
matrices and w' q = Q q w q . 

Since R q is an upper triangular matrix also known to the 
MT, s q can be easily detected from (fT4b using existing data 
detectors such as the QRD-M receiver [8]. In the sequel, this 
scheme is referred to as the per-subcarrier GMD-based scheme 
(PS-GMD). 

B. Capacity analysis 

It has been shown in [7] that the adverse effect of error 
propagation in decoding ( [Pfl ) is negligible for the high SNR 
region. As a result, ( TT~4-b can be rewritten as M equivalent 
parallel channels: 



\R a \ \s a \ ■ 



\W„ 



m= 1,2,---,M. (15) 



Thus, the supportable data throughput on the q-th subcarrier 
is given by 

M ( A 

Cq=J2 l0 &2 1+P [R*] m , m • (16) 

m=l ^ ' 

It is interesting to compare the capacity of PS-GMD and PS- 
EB. Recalling {A g , m } are the singular values of the MIMO 
channel over the q-th subcarrier, the corresponding supportable 
data throughput provided by PS-EB takes the following form: 

M 

T q = ]Tlog 2 (l + HA 9 , m | 2 ). (17) 

m— 1 

Observing that ^ and (O stand for two different decom- 
position expressions of the same matrix G q , we have 

Qq ■ Rq ■ P" = Uq ■ S g • V q " . (18) 

Upon taking the determinant of both sides and recalling that 
Q q , P, U q and V q are all unitary matrices, we can easily show 
that 

det {Rq) = det (S g ) , (19) 



or equivalently, 



A I 



M 



g,m I 



(20) 

Assuming p\X q . m \ > 1 and p\[R q ] m ^ m > 1, we can 
obtain the following asymptotic relationship between the two 
data throughputs as p increases to infinity. 

/ M . 

Mlog 2 (p) + 21og 2 [] \[R q ] mt 



C„ 



\m=X 



Mlog 2 ( j o) + 21og 2 ^JJ |A g , m M 



= 1. 



(21) 



Thus, the data throughput of PS-GMD approaches that of PS- 
EB asymptotically as p increases. 

IV. Per-cluster GMD-based scheme (PC-GMD) 

Further feedback reduction can be achieved by dividing 
the Q subcarriers into G exclusive clusters composed of U 
adjacent subcarriers with Q = G x U . Assuming that the 
channel conditions are approximately identical over the U 
subcarriers in the same cluster, the average supportable data 
throughput of each cluster, C g , is computed and fed back to 
the BS with 

' ^ ' (22) 

qel g 



C 3 - JJ ^2 c <*> 



where I g = {i Sj i, i g> 2, 1 • • > ig,u} is the subcarrier index set of 
the g-th cluster. 

As a result, only information about one BFM and G 
real-valued supportable throughputs are required to be fed 
back to the BS. In the sequel, this scheme is referred to as 
the per-cluster reduced-feedback opportunistic scheduling and 
beamforming scheme (PC-GMD). 

V. Discussion 

A. BFM feedback 

To derive dT4b . we have assumed that P is employed as 
the BFM in ( fTOb . However, in practice, BS and MTs share a 
common BFM codebook with finite entries of unitary matrices. 
As a result, the MT can only select a unitary matrix that best 
matches P and feed back the index of the chosen BFM to the 
BS. Denote by 2 B the number of entries in the codebook. 
A viable criterion to select the best BFM, Pd*, from the 
codebook is given by 



argrmn 

c2£[l,2 s ] 



(23) 



Clearly, there exists a tradeoff between a large codebook and 
feedback overhead. On the one hand, it is desirable to possess 
a large codebook such that the chance of achieving P H -Pd* = 
Im is improved. On the other hand, a large codebook requires 
a large B, which in turn incurs more feedback bits. The impact 
of the codebook size, B, on the system performance will be 
investigated in the next section through computer simulation. 

B. Comparison with [5] 

It is instructive to compare the proposed channel decompo- 
sition technique shown in ^ with PS-QRD proposed in [5], 
Recall from [5] that SVD is first applied on H = U ■ S • V H 
followed by decomposition of W q U ■ £ into the following 
QR form: 



W q U £ = Q q R, 



■q, 



(24) 



where U and Q q are unitary matrices while £ and R q are 
diagonal and upper triangular matrix, respectively. 



Let A q = Q q ■ W q ■ U, d24ji can be rewritten as 



R a 



Aq-X, 



(25) 
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Similarly, letting A q = ■ W q ■ B, we can rewrite R q 
in (O as 

R q = A q ■ E. (26) 
Since G q = Q q R q P H = Q q R q V H , we have 

det (G q ) = dct (R q ) = det (.R,) , (27) 
and subsequently, 



M 

n k 



m— 1 m— 1 



M 

n [*■ 



(28) 



where we have exploited the fact that Q q , P, Q q and V 
are unitary matrices while R q and R q are upper triangular 
matrices. 

Furthermore, it is easy to show that A q and A q are also 
upper triangular matrices. Thus, we have 



and 



Ra 



E] 



(29) 



(30) 



for m = 1, 2, • 

Since A g and A g are products of the same matrix W 9 
with two different unitary matrices, it is reasonable to argue 
that their diagonal elements follow the same distribution and 
have approximately the same values. However, in contrast to 
S with different eigenvalues on its diagonal, E has identical 
diagonal elements. As a result, we argue that it is more likely 



for \R n 



to be approximately equal for m = 1, 2, • 



,M. 



C. Comparison with PS/PC-EB 

Compared to PS/PC-EB, the total amount of feedback over- 
head of PS/PC-GMD is reduced by half. More importantly, 
since information about only one BFM is fed back per cluster 
in PC-EB, the chosen BFM may become a poor approximation 
of the true eigen-beamforming matrices for subcarriers in the 
cluster. As a result, severe system degradation is incurred 
as the cluster size increases. In contrast, PC-GMD requires 
feedback about only one BFM for all subcarriers, regardless 
of the cluster size. Consequently, the performance degradation 
due to clustering in PC-GMD is marginal. 

However, it is fair to say that the substantial feedback 
reduction of PS/PC-GMD is achieved at the cost of higher 
computational complexity compared to PS/PC-EB since the 
QRD-M receivers have to be employed to detect the data 
symbols. Furthermore, some BER performance degradation 
may be incurred due to the noisy detection errors. 

VI. Simulation Results 

Computer simulations are performed in this section to con- 
firm the performance of the proposed scheduling and beam- 
forming schemes. The simulated OFDMA system has Q = 64 
subcarriers and M = N = 2 antennas. The channel response 
of each pair of transmit and receive antennas is generated 
according to the channel model specified in [9]. Furthermore, 
the soft-QRD-M receiver proposed in [8] is employed for 



data detection with the QRD-M factor equal to 12 after noise 
whitening. Since we have assumed a homogenous network, 
the scheduling scheme that allocates the subcarrier/cluster 
to the MT with the highest throughput is fair. Finally, the 
source binary data is coded by a low-density parity-check 
code (LDPC) encoder with R = 7/8 before being mapped 
to a quadrature phase-shift keying (QPSK) constellation for 
all schemes under investigation. 

Case 1: BER performance comparison 

In the first example, we compare the BER performance of 
PS-GMD, PS-QRD and PS-EB. Fig. [Qshows their coded BER 
performance as a function of Eh/N^. 
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Fig. 1. BER performance as a function of SNR with K = 10. 

Inspection of Fig. Q] suggests that PS-QRD and PS-EB have 
approximately the same BER performance for large K. This 
is because their performance is susceptible to the subchannels 
associated with the small singular values. In contrast, PS- 
GMD provides good BER performance by benefiting from the 
subchannels with more evenly distributed channel gains. 

Case 2: Impact of codebook size 

In this example, we investigate the impact of the codebook 
size on coded BER performance. As B increases, the perfor- 
mance of PS-GMD improves. Interestingly, Fig. |2] suggests 
that PS-GMD with a smaller codebook outperforms PS-EB 
and PS-QRD with B = oo: 

Fig. [3] shows the coded BER performance of PS-GMD with 
different K and B. It is evident from Fig. [3] that the system 
degradation due to a smaller codebook can be alleviated by 
exploiting more multiuser diversity (i.e. more users) with PS- 
GMD. 

Case 3: Impact of cluster size 

In the last example, we compare the throughput performance 
of PC-GMD and PC-EB in terms of throughput performance 
with different cluster sizes (G = 2, 4, 8, 16, 32) at an SNR of 
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Fig. 2. Impact of the codebook size, B, on coded BER performance 
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Fig. 3. Impact of K and B on coded BER performance 



10 dB. As G increases, more accurate channel information is 
returned to the BS. As a result, the throughput performance 
increases for both schemes. However, for small G, channel 
variations within clusters become non-negligible. As a result, 
the BFM chosen by PC-EB to serve the whole cluster becomes 
a poor approximation of the ideal BFM for each subcarrier in 
the cluster, which incurs throughput degradation. In contrast, 
PC-GMD does not entail feedback loss regarding the BFM 
information, which leads to only marginal performance degra- 
dation as observed in Fig. HI even for small G. 

VII. Conclusion 

Two opportunistic scheduling and beamforming schemes, 
namely PS-GMD and PC-GMD, have been proposed for 
MIMO-OFDMA downlink systems by exploiting a novel 
beamforming technique where beamforming is performed 




Number of clusters, G 

Fig. 4. Average throughput as a function of the number of clusters, G. 



jointly by both the BS and MTs. The resulting PS-GMD 
scheme requires feedback of only one BFM and Q supportable 
throughputs for a MIMO-OFDMA system with Q subcarriers. 
A further feedback reduction is achieved by PC-GMD through 
grouping adjacent subcarriers into clusters and returning only 
cluster information from each MT. Since PC-GMD does not 
incur feedback loss regarding the BFM information, it only 
entails marginal performance degradation with respect to PS- 
GMD. It has been confirmed through asymptotic analysis and 
computer simulation that the proposed schemes can achieve 
good BER and throughput performance with substantially 
reduced feedback. 
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